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DATABASE SEARCHING METHOD AND SYSTEM 

The present invention relates to a method and system 
for searching a plurality of information databases. 
5 Databases are well - known- and widely used- for the 
organized storage of information. Depending upon the 
application in question, in many cases there is a great 
demand for the provision of searching methods to enable the 
stored information to be selectively accessed by a user. 

10 For this reason, a great deal of investment is often made 
in the production, updating and on-going development of 
such databases. The provision of improved searching 
methods forms part of this development. 

In fields of particular scientific or commercial 

15 interest there often exist a number of databases providing 
related and/or overlapping information. These databases 
might rs^vi":^ directly from different competing database 
suppl.V.-. - /^ example, due to the independent generation 

and cataloging of scientific information. 

2 0 One particular example of the use of numerous 

databases is in the field of biomedical science. The 
biomedical domain is a multi-disciplinary domain 
encompassing all areas of biology and medicine. There is 
a large and ever increasing volume of electronic biomedical 

2 5 information present upon a number of databases, which are 

individually dedicated to particular fields within the 
biomedical discipline . 

Access to such information in cases such as these is 
unfortunately frustrated by the large number of disparate 

3 0 data sources and the lack of a standard nomenclature being 

used between them. 

Although a multitude of nomenclature or classification 
systems exist, there is a lack of consistency relating to 
their architecture and content. This hinders the ease with 
3 5 which the databases can be accessed. The content can also 
be variable between such databases as expertly annotated 
versions tend to have- narrow discipline -related 
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perspectives, do not- cover historical terms and indeed are 
not contemporaneous . 

As a result, database users tend to focus their 
investigations upon single databases with which they are 
5 familiar. This has associated disadvantages in that 
information which is highly relevant to the user may be 
present upon one or more databases covering overlapping or 
related fields but this information will not become known 
to the user. 

10 One of the main problems in such interrelated 

disciplines is that particular terms used in one discipline 
may not be identical to those used in a different 
discipline and therefore automatic computer-based searching 
is severely limited. Furthermore, the arrangement of the 

15 information within such databases is generally unique to 
the database in question. The performance of a search upon 
multiple databases of this kind therefore often requires 
labourious' searching on specific individual databases with 
a detailed knowledge of each subject being needed in order 

2 0 to perform a high quality search. 

• There is therefore a need to provide an improved 
searching method to enable searching across multiple 
databases . 

In accordance with a first aspect of the present 
25 invention we provide a method of searching a plurality of 
information databases for records related to an input 
search term, comprising :- 

selecting a group of related search terms containing 
the input search term, . from a search database of terms 

3 0 arranged in predefined groups according to their 

relationship with one another, wherein each term is present 
within one or more of the information databases; and, 

searching for terms from the selected group within a 
data repository comprising selected data previously 
3 5 extracted from the records of each information database, to 
identify the corresponding records within the information 



databases which contain the terms within the selected 
group . 

The present invention overcomes many of the problems 
associated with searching a plurality of information 
databases, "in that groups of related search terms are used 
to search upon the various databases provided. These 
groups are predefined and, when an inputted search term is 
provided by a user, the search database is queried in order 
to select the one or more groups containing this inputted 
search term. In particular, this allows dissimilar terms 
having identical or similar meanings, to be searched -upon 
the plurality of information databases. This greatly 
improves the power of the searching technique and directly 
allows extension of searching beyond a single database to 
multiple • databases. The speed of multiple database 
searching is therefore improved as a result. 

The method particularly benefits normal use^-SlvVc^ 
familiar with only a single discipline, i:v 
provision of searching across multiple discij;V.^i^^ is 
provided without a detailed knowledge of these 1 other 
disciplines being required. 

The present invention is not limited to any particular 
types of information databases nor to the subject matter of 
their contents. However, the invention is particularly 
advantageous for use in cases where a number of large and 
complex information databases are provided, each providing 
related or overlapping information. This is notably the 
case in the biomedical field. 

The present invention also recognises the .problem 
that, for many databases, searching for information within 
more than one database may increase the amount of processor 
time required for searching. This is addressed by 
previously extracting selected data from the various 
information databases and storing it in a dedicated data 
repository. Only selected data is normally needed for 
search purposes, because with most types of search it is 
not necessary to search through all data contained within 
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each record of the information databases. .One example of 
this is in the searching of a biotechnology database in 
which lengthy gene sequences are provided but the searching 
of these actual sequences is not required. The presence of 
5 such sequences represents a large amount of redundant data 
insofar as a search is concerned which is related to the 
causes of disease. 

It is therefore advantageous to extract data from the 
records of such information databases and to store the data 
10 separately in a data repository such that the speed and 
efficiency with which the data, may be searched can be 
improved. 

The data repository is preferably arranged as a number 
of records, with a repository record corresponding to a 

15 record present within one of the information databases. 
There is therefore preferably a direct correspondence 
between the number of indi^"U v':^ds in the information 

databases and the number : 1 sr-^-'i 'dual records in the 
repository." Each record x , repository preferably 

20 further comprises a pointer identifying the specific record 
in the information database to which it relates. This is 
used to allow access by a user to the full record when 
required. 

In the case of a direct correspondence of records 
25 between the repository and databases, this access may be 
achieved by simply using identical record identifiers (such 
as accession numbers) . However in cases of non-direct 
correspondence, a specific and separate pointer to the 
particular record is used. 
30 Due to the extraction of the data from the information 

databases, typically the amount of selected data in the 
repository is less than that contained in the information 
databases . The degree to which the former amount is 
smaller is dependent upon the particular type of record 
3 5 used and the fields which are desired to be searched within 
each record. 
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In general, the data in the repository comprises 
definitional and/or semantic data. The definitional data 
preferably describes data in terms of its nature, use or 
value whereas the semantic data preferably describes 
5 alternative terms for the 'data in the information 
databases. Generally, the semantic data • describes 

synonymous terms in the information databases. 

Within the search database, each term preferably has 
corresponding* metadata indicating the one or more 
10 information databases within which the particular term is 
contained. This information can be used to reduce needless 
searching upon databases where it is known that no such 
term is present. This therefore increases the search speed 
during use. Such metadata also preferably indicates the 
1-5 -one •■ or more- fields of the information database (s) .within, 
which it is contained as it will be recognised that each 
information. Z<w :v-?*ye. generally has a unique format. 

" Pretv- : ■ . v ; v~- terms in the predefined groups are- 

arranged the search database such that the 

/j_ * • 

20 predefined groaps are formed from synonymous terms. Each 
group is also typically provided with a unique group 
identifier. 

Due to the possibility that an inputted search term 
may be found within more than one group, the method 
25 preferably further comprises determining the context of the 
records retrieved using the inputted search term (and 
associated group of terms) . Following identifying the 
groups in which the term is present, when the repository is 
searched the context of each record may be determined 
3 0 during the search itself (to limit the number of records 
. returned) or later following the selection of all records 
containing any terms in the group. 

The context may be determined based upon the field 
type of the repository record in which the term is found 
35' such as a "domain" . Alternatively, or additionally, the 
context may be determined by searching for the presence of 
• one or more of the other terms, within the group, in the 
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same field or record of the repository. This allows 
automatic selection of the correct search subject. 

in general, the method according to the first aspect 
of the invention is performed by a computer program 
5 • comprising suitable computer program code means . Such a 
computer program may be retained upon a computer readable 
medium. 

In accordance with the second aspect of the present 
invention, we provide a database searching system for 
10 searching a plurality of information databases for records 
related to an inputted search term, the system comprising: - 
a search database comprising related search terms 
arranged into predefined groups according to their 
relationship to one another, wherein each term is present 
1.5 within one or more of the information databases;. 

selection means, for selecting a group containing the 
-■nputted search term from the search database; 

a data repository comprising selected data previously 
extracted from the records of each information database; 
20 and, 

searching means for searching the repository for terms 
from the selected group to identify the corresponding 
records within the information databases which contain the 
terms within the selected group. 

25 Preferably the search term is provided to the system 

using an input means which may take the form of a local 
input device, or alternatively a communication network such 
as the internet. The use of a communication network allows 
users to access the system from remote locations. The 

30 system may also comprise the information databases 
themselves, although typically these are also located 
remotely from the data repository. The selection and 
searching means are typically provided as a combined query 
system upon a computer. This computer may also contain 

35 either or both of the data repository and the search 
database . 



An example of a multiple database search method and 
system according to the present invention will now be 
described, with reference to the accompanying drawings, in 
which: - 

Figure i Is a schematic representation of the search- 
system; and 

Figure 2 is a flow diagram of a method of searching 
using the search system. 

A multiple database system relating to the field of 
biomedical science is generally indicated at 1 in Figure 1. 

A number of individual proprietary information databases 

» 

are indicated at 2,3 and 4. Examples of these databases 
include "Genbank" (National Centre For Biotechnology 
Information) , "Swissprot" (European Bioinf ormatics 
Institute) , "OMIM" (National Centre For Biotechnology 
Information) and "UMLS" (National Library Of Medicine) . In 
this example, three information databases are 
relating to gene sequences and genetic disorders.:. 

A data repository 5 is arranged in communication : / 
each of the information databases 2,3,4. The data 
repository 5 is organised as a database, stored on a local 
computer server. The information databases 2,3,4 are 
stored upon remote servers and accessed by the data 
repository 5 using a suitable network such as the Internet. 
A query system 6 is arranged to access the data repository 
5 and is implemented by suitable software running upon a 
local computer (which may be the server upon which the data 
repository 5 is stored) . 

A separate search database 7 is also provided oh the 
query system computer and this is arranged to be accessed 
by the query system 6. An input means 8 is provided to 
allow a user of the system to access the query system 6 
In the present example, the input means 8 is a remote 
computer connected via a communication network such as the 
Internet, to the query system 6. Alternatively, it could 
be a local input device such. as a keyboard . attached to- the 
query system computer. 
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.Regarding the information databases 2,3,4 these are 
generally arranged as a large number of records, with each 
record corresponding to a particular entity. In the case 
of the Genbank database, the records are arranged according 
to individual gene sequences . Each record contains a large 
number of fields. Examples of these for the Genbank 
information database include: LOCUS / DEFINITION, ACCESSION, 
VERSION, KEYWORDS , SEGMENT, SOURCE, ORGANISM, REFERENCE, 
AUTHORS, TITLE, JOURNAL. A large amount of data is 
therefore provided in each record and not all of thxs xs 
useful for searches of the type provided by the system of 
this example. 

The data repository 5 provides a copy of each record 
within each of the information databases 2,3,4 and 
therefore mirrors the -content of these databases. However, . 
for each record, only data within selected fields xs 
retained within the data repository therefore .rec^s 

within the data repository cont:,3,> „b ^.eially less data 
than that provided within the F-cord upon the 

respective information databases. As to which fields are 
copied into the data repository 5, this is determined by 
the administrator of the system 1 and is dependent upon the 
type of searching services which are to be provided to a 

user. , _ «» • 

Table 1 shows part of a record withxn the data 
repository 5 relating to the Genbank record for the HTR2B 
gene (AF156159) . 
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TABLE 1 



Extracted Term 


Genbank Field 


Meta-Data Type 


Meta-Data 
Field 


HSHTR2B2 


LOCUS 


definitional/semantic 


SYNONYM 


DNA 


LOCUS 


definitional 


DOMAIN 


21-APR-2000 


LOCUS 


definitional 


ENTRY DATE 


HTR2B 


DEFINITION . 


semantic 


SYNONYM 
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Homo sapiens 5- 
hydroxytryptamine 2B receptor 
(HTR2B) gene, exon 2. 


DEFINITION ' 


definitional 


DEFINTION 


AF156159 


ACCESSION 


definitional/semantic 


SYNONYM 


Homo sapiens 


ORGANISM 


definitional 


SPECIES 


HTR2B 


FEATURES / 
mRNA/ gene 


semantic 


SYNONYM 


5-hydroxytryptamine 2B 
receptor 


FEATURES / 
mRNA / product 


semantic 


SYNONYM 


HTR2B 


FEATURES / 
gene / gene 


semantic 


SYNONYM 


HTR2B 


FEATURES / 
CDS / gene 


semantic 


SYNONYM 


5-hydroxytryptamine 2B 
receptor 


FEATURES / 
CDS / product 


semantic 


SYNONYM 



15 In addit-'*-' — K /^:e "Extracted term" data and €fie 

•JGenbank f 1g7,> ■ ..cVo-.- • ... v.vz-tracted from Genbank and ret.aiiied 
in the respective, - elumns, the "Meta-Data Type" and "Meta- 
Data Field" columns of Table 1 provide additional 
information defining the type of data which is contained in 

20 the respective field. This is described as "meta-data" 
because data in these fields describes the data obtained 
from the information databases 2,3,4. Two types of meta- 
data are used in this example system, these being 
"definitional" and "semantic". 

25 Definitional meta-data is information that is used to 

uniquely describe and/or categorise data in terms of its 
nature, use, value and encumbrances. Semantic meta-data 
provides alternative terms for data such as synonyms or 
cross-references. Semantic meta-data is used to infer 

30 equality in meaning between data from the information 
databases 2,3,4. These two types of meta-data are not 
exclusive and therefore meta-data can be both descriptive 
and semantic. For example a gene name for a data record 
may be both definitional arid semantic meta-data. 
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The "Meta-data type- column shows the kind of meta- 
data to which each extracted field relates and the "Meta- 
data Field" column defines a corresponding meta-data field 
for searching purposes, It can be seen in this latter case 
that a number, of the fields from- the information databases 
are assigned to the same meta-data field, namely " SYNONYM" . 

in this particular record, the term "DNA" from this 
record is assigned to the 11 DOMAIN " meta-data field. The 
use of domains is described in more detail later. 

Each record within the repository 5 also has 
associated meta-data in the form of a "pointer" which, 
identifies the database- and record from which the data was 
obtained. In this case, the Genbank field "ACCESSION" is 
used to identify the record and separate data (not shown in 
the Table 1) identifies the. Genbank database. 

Turning now to the search database 7, this is also 
. ranged as a number of records, each record defining, a 
,' V! ,.. of synonymous terms ... These terms are obtained from 
cne information databases 2,3,4 and may relate to not only 
,ome synonymous terms within the same database but also 
synonymous terms between different information databases. 
Each record in search database 7, may also define broader 
and/or narrower related terms. Table 2 is an example of 
extracted synonyms from the Genbank record shown in Table 



TABLE 2 



30 



35 



Identifier 


Synonym 


Preferred Term 


012345678 


HSHTR2B2 


HTR2B 


012345678 


HTR2B 


HTR2B 


012345678 


AF156159 


HTR2B 


012345678 


5-hydroxytryptamine 2B 
receptor 


HTR2B 



Each synonym is assigned to a particular group 
identified with a' corresponding group identifier which xs 
internal to the system. Additionally, each group of 
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synonyms has a "preferred" term which typically is the most 
commonly used or most convenient term for explanatory 
purposes. However, whether the actual preferred term is 
used as the inputted search term, does not affect the 
search scope. ' 

Table 3 shows part of a typical record upon the search 
database 7, containing synonyms extracted from the three 
information databases 2,3,4, for example Genbank, Swissprot 
and OMIM. Any degeneracy between the terms extracted from 
these information databases is removed. 



TABLE 3 
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identifier 


Synonym 


Preferred Term 


012345678 


HSHTR2B2 


HTR2B 




HTR2B 






AF156159 






5-hydroxytryptamine 2B 
receptor 






5-HT2B 






5HT2B 






Serotonin 2B receptor 





Referring back to Table 1, it can be seen that each of 
the extracted terms which were assigned to the "SYNONYM" 
meta-data field, are also found within the same record in 
Table 3 (as the first four entries in the "Synonym" 
column) . The use of the meta-data field increases the 
searching speed when a search for synonymous terms is being 
performed within the records of the data repository 5, as 
searching in other fields is not needed. It should be 
remembered that the data repository 5 contains records from 
a number of different information databases 2,3,4 and 
therefore assigning meta-data fields produces this speed 
increase . 

Further information is also present within the records 
of the search database 7, for example, in the case of each 




synonym, an identifier is provided to identify the 
database (s) and in some cases the field (s) in which the 
term is present. Each of the search database records also 
contains a brief textual description of the subject to 
5 which the synonyms relate, such as "Gene that encodes the 
5-hydroxpytryptamine 2B receptor". 

Figure 2 shows a flow diagram of a suitable method for 
use in the database searching system 1. At step 100 in 
Figure 2, a user of the system inputs a search term using 

10 the input means 8. At step 101, other information is also 
provided, for example in that the user selects a number of 
information databases upon which to search for the search 
term and possibly, a limitation to one or more field types 
in which to search for this term. 

15 in the present example, each of the databases 2,3,4 is 

selected and the user chooses all field types for 
searching. At step 102, the query sy**^f*-. 5' analyses the 
input search term and then searr-KM vac;. *;he search 
database 7 for any records contain' :; ~ur.. rrriput search: 

20 terms. This returns one or more "hiive/'p, -Xlat is records 
containing the search term as one of the synonymous terms. 
These records are then retrieved at step 103 and presented 
to the user. 

In some cases, the search term will be present in more 

2 5 than one of the records upon the search database 7. In this 

case, the user can view the textual description attached to 
the record in order to select the type of information 
required. 

Having reviewed the record description, at step 104, 

3 0 the user selects the particular record to which the 

intended search relates. At step 105, the synonymous terms 
held in the selected record of the search database 7 are 
then searched in the required fields of the records held in 
the data repository 5 . Only those fields corresponding to 
3 5 the particular information databases selected by the user 
are searched and the results are then returned to the user 
at step 106. • 




At step 107 a context filtering step is performed 
which analyses the records in order to discard records 
which are not related to the desired search. For example, 
in a case where more than one search database record is 

• 5- initially returned, there will exist at least one -synonym 
(the search term) which is used upon the information 
databases in two different contexts. It is desirable to 
prevent the display of records which do not relate to the 
context of interest. This is achieved by context 

10 filtering. 

The method chosen for this filtering depends upon the 
way in which the information databases are structured. In 
the case of more unstructured databases, for example 
databases of the full text of scientific publications, an 

15 appropriate filtering technique is to search for other 
words relating to the context of interest within the 
records (such as ■•':;iy.g - for the other synonym'sr.' 

none are found t?.- v^ /. ;*--/'.?.rd in question can be assigned 

a low likelihood u: ^^Wance. If desired, this ban be 

2 0 expressed mathemati c^vi:y for filtering and/or presented to 

the user. - 

For more structured information databases such as the 
biomedical science databases used in the present example, 
context filtering can be performed using the "domain" field 
25 as mentioned earlier. ■ Upon construction of the data 
repository 5, the records are . assigned to specific 
"domains" which represent broad topic classes such as DNA, 
disease, and so on. In this case, synonyms in a single 
search database record relate to information database 

3 0 records within a single domain. The search for records 

within the repository 5 can therefore be limited to records 
having the domain common to the synonyms within the group 
of interest. A combination of these and other techniques 
can therefore be performed to effect context filtering. 
3 5 This filtering may be performed following retrieval of all 
of the records as in. the present case, or it may be 
performed "on-the~f ly" . 
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The retrieved and context filtered records from the 
data repository 5 are presented to the user at step 108. 
On selection of a particular record of interest by the 
user, at step 109 the pointer within the particular 
5 • repository record of interest is accessed to discover the 
identity of the corresponding record upon one of the 
information databases 2,3,4. This full record is then 
retrieved from the specific information database and 
displayed to the user at step 110. 

10 The above method can therefore advantageously be used 

to search for related information in databases which use 
different but synonymous terms to describe similar 
information. The selection of the extent to which terms 
are synonymous is at the discretion of the system 

15 administrator. Broader searches can be performed by using 
related rather than synonymous terms. 

though the amount of information searched ; is 
v .j x,.',y in excess of that ; searched using a single 

• the speed and efficiency of the searching is 

2 0 eiguV: *:icantly increased by the use of the data repository 
in which selected record extracts are used for searching 
purposes . 

In the present system, the user is not limited to 
searching using the technique described above as the method 
2 5 can be integrated with other conventional database 
searching tools which access the repository or the 
information databases directly. 
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CLAIMS 



1. 



A method of* searching a plurality of information 



databases for records related to an input search term, 

5 " comprising:-* * * ■ *■ 

selecting a group of related search terms containing 
the input search term, from a search database of terms 
arranged in predefined groups according to their 
relationship with one another, wherein each term is present 
10 within one or more of the information databases; and, 

searching for terms from the selected group within a 
data repository comprising selected data previously 
extracted from the records of each information database, to 
identify the corresponding records within the information 
15 databases which contain the terms within the selected 



20 information databases. . 

3. A method according to claim 2, wherein each record in 
the repository comprises a pointer identifying the record 
in the information database to which it relates. 

4. A method according to any of the preceding claims, 

2 5 wherein the amount of selected data in the repository is 

less than that contained in the information databases. 

5. A method according to any of the preceding claims, 
wherein the data in the repository comprises definitional 
data. 

3 0 6. A method according to claim 5, wherein the definitional 

data describe data in terms of its nature, use or value. 

7. A method according to any of the preceding claims, 
wherein the data in the repository comprises semantic data. 

8. A method according to claim 7, wherein the semantic 
3 5 data describes alternative terms for the data in the 

information database. 



group . 



2. A method according to claim 1, wherein the data 
repository is arranged as a number of records, each .reccrf 
corresponding to a record present within one of" ciife 



4 
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9. A method according to claim 8, wherein the semantic 
data describe synonymous terms in the information 
databases. 

10. A method according to any of the preceding claims, 
5 wherein each term in each predefined group within the 

search database has associated metadata indicating the one 
or more information databases within which the term is 
contained. 

^ 11. A method according to claim 10 , wherein the 
10 corresponding metadata indicates the one or more fields of 

the information database (s) within which it is contained. 

12 . A method according to any of the preceding claims 

wherein a number of records within the data repository are 

assigned to a domain. 
15 13 . A method according to any. of the preceding claims, 

wherein the terms in the predefined groups within the 

search database are synonymous terms. 

14. A method according to any of the :.rt claims, 
wherein each group has an associated gi w- 4" ; 2tier . 

20 15. A method according to claim 13 or clc.i:r ,14, wherein 
each group has associated descriptive data for describing 
the group. 

16. A method according to any of the preceding claims, 
further comprising determining the context of any 

2 5 repository records located. 

17. A method according to claim 16 and when dependent upon 
claim 12 , wherein the context is determined by limiting the 
search to repository records having a common domain. 

18. A method according to claim 16 or claim 17, wherein the 

3 0 -context is determined by searching for the presence of one 

or more of the other terms within the group, in the same 
record of the repository. 

19. A computer program comprising computer program code 
** means adapted to perform the method according to any .of the 

3 5 preceding claims. 

20. A computer program m according to claim 19, embodied 
upon a. computer readable, medium. 



p 




21. A database searching system for searching a plurality 
of information databases for records related to an inputted 
search term, the system comprising 

a search database comprising related search terms 
- 5 arranged into predefined groups according to -their 
relationship to one another, wherein each term is present 
within one or more of the information databases; 

selection means, for selecting a group containing the 
inputted search term from the search database; 
10 a data repository comprising selected data previously 

extracted from the records of each information database ; 
and, 

searching means for searching the repository for terms 
from the selected group to identify the corresponding 
15 records within the information databases which contain the 
terms within the selected group. 

22. A system according- ci; claim 21, wherein further 
comprising an input rr < u.v t * r*. .-^plying the inputted search 
term to the selection .±i> ■* ; 

20 23. A system accordin^V-^o claim' 22, wherein the input 
means comprises a communication network such that the 
inputted search term is received from a remote location. 

24. A system according to any of claims claim 21 to 23, 
further comprising a plurality of information databases 

2 5 from which data is extracted for storage within the data 
repository. 

25. A system according to any of claims 21 to 24, wherein 
the data repository, is stored upon a separate computer 
system with respect to the information databases . 
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